Basic Statistics

Raw Counts

Name Value
Rows 576,782
Columns 28
Discrete columns 6
Continuous columns 22
All missing columns 0
Missing observations 1,776,620
Complete Rows 460,860
Total observations 16,149,896
Memory allocation 117.4 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 3 columns ignored with more than 50 categories.
## PLAYER_NAME: 2294 categories
## COMMENT: 5330 categories
## MIN: 3383 categories

QQ Plot

## Warning: Removed 804 rows containing non-finite values (stat_qq).
## Warning: Removed 804 rows containing non-finite values (stat_qq_line).

## Warning: Removed 1206 rows containing non-finite values (stat_qq).
## Warning: Removed 1206 rows containing non-finite values (stat_qq_line).

## Warning: Removed 579 rows containing non-finite values (stat_qq).
## Warning: Removed 579 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 4 features with more than 20 categories ignored!
## TEAM_ABBREVIATION: 34 categories
## TEAM_CITY: 33 categories
## PLAYER_NAME: 1958 categories
## MIN: 3310 categories
## Warning in cor(x = structure(list(GAME_ID = c(21900895L, 21900895L, 21900895L, :
## the standard deviation is zero

Principal Component Analysis

## 2 features with more than 50 categories ignored!
## PLAYER_NAME: 1958 categories
## MIN: 3310 categories
## Warning in plot_prcomp(data = structure(list(GAME_ID = c(21900895L, 21900895L, : The following features are dropped due to zero variance:
##  * COMMENT_